Automatic Algorithm Development Using New Reinforcement Programming Techniques

نویسندگان

  • Spencer K. White
  • Tony R. Martinez
  • George L. Rudolph
چکیده

ion in reinforcement learning. Artificial Intelligence, 112(1-2): 181–211. URBANOWICZ, R. J., and J. H. MOORE. 2009. Learning classifier systems: A complete introduction, review, and roadmap. Journal of Artificial Evolution and Applications, 2009. doi: 10.1155/2009/736398. WATKINS, C. J. 1989. Learning from delayed rewards. Ph.D. thesis, Cambridge University, Cambridge, UK. WHITE, S., T. R. MARTINEZ, and G. RUDOLPH. 2010. Generating three binary addition algorithms using reinforcement programming. In Proceedings of the 48th Annual Southeast Regional Conference (ACMSE ’10). ACM Press: New York. DOI: 10.1145/1900008.1900072 http://doi.acm.org/10.1145/1900008.1900072 WHITE, S. K. 2006. Reinforcement Programming: A New Technique in Automatic Algorithm Development. Master’s thesis, Brigham Young University, Provo, UT. WHITESON, S., and P. STONE. 2006. Evolutionary function approximation for reinforcement learning. Journal of Machine Learning Research, 7: 877–917. XU, X., D. HU, and X. LU. 2007. Kernel-based least squares policy iteration for reinforcement learning. IEEE Transactions on Neural Networks, 18(4): 973–992.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Shuffled Frog-Leaping Programming for Solving Regression Problems

There are various automatic programming models inspired by evolutionary computation techniques. Due to the importance of devising an automatic mechanism to explore the complicated search space of mathematical problems where numerical methods fails, evolutionary computations are widely studied and applied to solve real world problems. One of the famous algorithm in optimization problem is shuffl...

متن کامل

Automatic ei

This paper describes a general approach for automatically programming a behavior-based robot. New behaviors are learned by trial and error using a performance feedback function as reinforcement. Two algorithms for behavior learning are described that combine techniques for propagating reinforcement values temporally across actions and spatially across states. A behavior-based robot called OBELI...

متن کامل

Automatic ei Sriclhar ahadevan an Jonathan Connell

This paper describes a general approach for automatically programming a behavior-based robot. New behaviors are learned by trial and error using a performance feedback function as reinforcement. Two algorithms for behavior learning are described that combine techniques for propagating reinforcement values temporally across actions and spatially across states. A behavior-based robot called OBELI...

متن کامل

Network Planning Using Iterative Improvement Methods and Heuristic Techniques

The problem of minimum-cost expansion of power transmission network is formulated as a genetic algorithm with the cost of new lines and security constraints and Kirchhoff’s Law at each bus bar included. A genetic algorithm (GA) is a search or optimization algorithm based on the mechanics of natural selection and genetics. An applied example is presented. The results from a set of tests carried ...

متن کامل

Genetic Encoding of Agent Behavioral Strategy

The general framework tackled in this paper is the automatic generation of intelligent collective behaviors using genetic programming and reinforcement learning. We define a behavior-based system relying on automatic design process using artificial evolution to synthesize high level behaviors for autonomous agents. Behavioral strategies are described by tree-based structures, and manipulated by...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Intelligence

دوره 28  شماره 

صفحات  -

تاریخ انتشار 2012